Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 844338 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 161.0 MiB |
| Average record size in memory | 200.0 B |
Variable types
| Numeric | 14 |
|---|---|
| Categorical | 9 |
| DateTime | 2 |
df_index is highly correlated with store | High correlation |
store is highly correlated with df_index | High correlation |
competition_open_since_year is highly correlated with competition_time_month | High correlation |
day_of_week is highly correlated with dayofweek | High correlation |
month is highly correlated with weekofyear | High correlation |
weekofyear is highly correlated with month | High correlation |
dayofweek is highly correlated with day_of_week | High correlation |
competition_time_month is highly correlated with competition_open_since_year | High correlation |
df_index has unique values | Unique |
dayofweek has 137557 (16.3%) zeros | Zeros |
competition_time_month has 268025 (31.7%) zeros | Zeros |
promo_time_week has 421646 (49.9%) zeros | Zeros |
Reproduction
| Analysis started | 2021-08-08 14:49:50.011949 |
|---|---|
| Analysis finished | 2021-08-08 14:53:51.891863 |
| Duration | 4 minutes and 1.88 second |
| Software version | pandas-profiling v2.12.0 |
| Download configuration | config.yaml |
| Distinct | 844338 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 508596.3788 |
| Minimum | 0 |
|---|---|
| Maximum | 1017207 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 51060.85 |
| Q1 | 254520.25 |
| median | 508254.5 |
| Q3 | 762678.75 |
| 95-th percentile | 966495.3 |
| Maximum | 1017207 |
| Range | 1017207 |
| Interquartile range (IQR) | 508158.5 |
Descriptive statistics
| Standard deviation | 293481.2635 |
|---|---|
| Coefficient of variation (CV) | 0.5770415907 |
| Kurtosis | -1.198309067 |
| Mean | 508596.3788 |
| Median Absolute Deviation (MAD) | 254062.5 |
| Skewness | 0.001379351741 |
| Sum | 4.294272493 × 1011 |
| Variance | 8.6131252 × 1010 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 292388 | 1 | < 0.1% |
| 366104 | 1 | < 0.1% |
| 362010 | 1 | < 0.1% |
| 364059 | 1 | < 0.1% |
| 374300 | 1 | < 0.1% |
| 376349 | 1 | < 0.1% |
| 370206 | 1 | < 0.1% |
| 284192 | 1 | < 0.1% |
| 286241 | 1 | < 0.1% |
| Other values (844328) | 844328 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 |
| Value | Count | Frequency (%) |
| 1017207 | 1 | |
| 1017206 | 1 | |
| 1017205 | 1 | |
| 1017204 | 1 | |
| 1017202 | 1 |
| Distinct | 1115 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 558.4213739 |
| Minimum | 1 |
|---|---|
| Maximum | 1115 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 56 |
| Q1 | 280 |
| median | 558 |
| Q3 | 837 |
| 95-th percentile | 1060 |
| Maximum | 1115 |
| Range | 1114 |
| Interquartile range (IQR) | 557 |
Descriptive statistics
| Standard deviation | 321.7308614 |
|---|---|
| Coefficient of variation (CV) | 0.5761435297 |
| Kurtosis | -1.198836421 |
| Mean | 558.4213739 |
| Median Absolute Deviation (MAD) | 278 |
| Skewness | 0.0004258853753 |
| Sum | 471496386 |
| Variance | 103510.7472 |
| Monotocity | Increasing |
| Value | Count | Frequency (%) |
| 335 | 942 | 0.1% |
| 733 | 942 | 0.1% |
| 682 | 942 | 0.1% |
| 562 | 942 | 0.1% |
| 1097 | 942 | 0.1% |
| 262 | 942 | 0.1% |
| 769 | 942 | 0.1% |
| 494 | 942 | 0.1% |
| 423 | 942 | 0.1% |
| 85 | 942 | 0.1% |
| Other values (1105) | 834918 |
| Value | Count | Frequency (%) |
| 1 | 781 | |
| 2 | 784 | |
| 3 | 779 | |
| 4 | 784 | |
| 5 | 779 |
| Value | Count | Frequency (%) |
| 1115 | 781 | |
| 1114 | 784 | |
| 1113 | 784 | |
| 1112 | 779 | |
| 1111 | 779 |
store_type
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| a | |
|---|---|
| d | |
| c | |
| b | 15560 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 844338 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | c |
|---|---|
| 2nd row | c |
| 3rd row | c |
| 4th row | c |
| 5th row | c |
| Value | Count | Frequency (%) |
| a | 457042 | |
| d | 258768 | |
| c | 112968 | 13.4% |
| b | 15560 | 1.8% |
| Value | Count | Frequency (%) |
| a | 457042 | |
| d | 258768 | |
| c | 112968 | 13.4% |
| b | 15560 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 457042 | |
| d | 258768 | |
| c | 112968 | 13.4% |
| b | 15560 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 844338 |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 457042 | |
| d | 258768 | |
| c | 112968 | 13.4% |
| b | 15560 | 1.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 844338 |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 457042 | |
| d | 258768 | |
| c | 112968 | 13.4% |
| b | 15560 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 844338 |
Most frequent character per block
| Value | Count | Frequency (%) |
| a | 457042 | |
| d | 258768 | |
| c | 112968 | 13.4% |
| b | 15560 | 1.8% |
assortment
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| basic | |
|---|---|
| extended | |
| extra | 8209 |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 6.390156549 |
| Min length | 5 |
Characters and Unicode
| Total characters | 5395452 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | basic |
|---|---|
| 2nd row | basic |
| 3rd row | basic |
| 4th row | basic |
| 5th row | basic |
| Value | Count | Frequency (%) |
| basic | 444875 | |
| extended | 391254 | |
| extra | 8209 | 1.0% |
| Value | Count | Frequency (%) |
| basic | 444875 | |
| extended | 391254 | |
| extra | 8209 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1181971 | |
| d | 782508 | |
| a | 453084 | 8.4% |
| b | 444875 | 8.2% |
| s | 444875 | 8.2% |
| i | 444875 | 8.2% |
| c | 444875 | 8.2% |
| x | 399463 | 7.4% |
| t | 399463 | 7.4% |
| n | 391254 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5395452 |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 1181971 | |
| d | 782508 | |
| a | 453084 | 8.4% |
| b | 444875 | 8.2% |
| s | 444875 | 8.2% |
| i | 444875 | 8.2% |
| c | 444875 | 8.2% |
| x | 399463 | 7.4% |
| t | 399463 | 7.4% |
| n | 391254 | 7.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5395452 |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 1181971 | |
| d | 782508 | |
| a | 453084 | 8.4% |
| b | 444875 | 8.2% |
| s | 444875 | 8.2% |
| i | 444875 | 8.2% |
| c | 444875 | 8.2% |
| x | 399463 | 7.4% |
| t | 399463 | 7.4% |
| n | 391254 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5395452 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 1181971 | |
| d | 782508 | |
| a | 453084 | 8.4% |
| b | 444875 | 8.2% |
| s | 444875 | 8.2% |
| i | 444875 | 8.2% |
| c | 444875 | 8.2% |
| x | 399463 | 7.4% |
| t | 399463 | 7.4% |
| n | 391254 | 7.3% |
competition_distance
Real number (ℝ≥0)
| Distinct | 655 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5961.827515 |
| Minimum | 20 |
|---|---|
| Maximum | 200000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 130 |
| Q1 | 710 |
| median | 2330 |
| Q3 | 6910 |
| 95-th percentile | 20930 |
| Maximum | 200000 |
| Range | 199980 |
| Interquartile range (IQR) | 6200 |
Descriptive statistics
| Standard deviation | 12592.18111 |
|---|---|
| Coefficient of variation (CV) | 2.112134421 |
| Kurtosis | 145.2886585 |
| Mean | 5961.827515 |
| Median Absolute Deviation (MAD) | 1980 |
| Skewness | 10.13490772 |
| Sum | 5033797520 |
| Variance | 158563025 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 250 | 9210 | 1.1% |
| 50 | 6249 | 0.7% |
| 350 | 6239 | 0.7% |
| 1200 | 6069 | 0.7% |
| 190 | 6066 | 0.7% |
| 90 | 5607 | 0.7% |
| 180 | 5421 | 0.6% |
| 330 | 5294 | 0.6% |
| 150 | 5292 | 0.6% |
| 140 | 4684 | 0.6% |
| Other values (645) | 784207 |
| Value | Count | Frequency (%) |
| 20 | 779 | 0.1% |
| 30 | 3115 | |
| 40 | 3888 | |
| 50 | 6249 | |
| 60 | 2342 | 0.3% |
| Value | Count | Frequency (%) |
| 200000 | 2186 | |
| 75860 | 887 | |
| 58260 | 885 | |
| 48330 | 784 | 0.1% |
| 46590 | 784 | 0.1% |
competition_open_since_month
Real number (ℝ≥0)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.787355301 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.309916685 |
|---|---|
| Coefficient of variation (CV) | 0.4876592632 |
| Kurtosis | -1.231875281 |
| Mean | 6.787355301 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.04845105686 |
| Sum | 5730822 |
| Variance | 10.95554846 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 112179 | |
| 4 | 98204 | |
| 11 | 86359 | |
| 3 | 80052 | |
| 7 | 76226 | |
| 12 | 63968 | |
| 6 | 63913 | |
| 10 | 63216 | |
| 5 | 58271 | |
| 2 | 56895 | |
| Other values (2) | 85055 |
| Value | Count | Frequency (%) |
| 1 | 37733 | 4.5% |
| 2 | 56895 | |
| 3 | 80052 | |
| 4 | 98204 | |
| 5 | 58271 |
| Value | Count | Frequency (%) |
| 12 | 63968 | |
| 11 | 86359 | |
| 10 | 63216 | |
| 9 | 112179 | |
| 8 | 47322 |
| Distinct | 23 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2010.331102 |
| Minimum | 1900 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 2002 |
| Q1 | 2008 |
| median | 2012 |
| Q3 | 2014 |
| 95-th percentile | 2015 |
| Maximum | 2015 |
| Range | 115 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.502627816 |
|---|---|
| Coefficient of variation (CV) | 0.002737174891 |
| Kurtosis | 123.9030779 |
| Mean | 2010.331102 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -7.217322842 |
| Sum | 1697398942 |
| Variance | 30.27891288 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2013 | 170465 | |
| 2014 | 151774 | |
| 2015 | 91118 | |
| 2012 | 61716 | 7.3% |
| 2005 | 46703 | 5.5% |
| 2010 | 42715 | 5.1% |
| 2011 | 41363 | 4.9% |
| 2009 | 40711 | 4.8% |
| 2008 | 40195 | 4.8% |
| 2007 | 36125 | 4.3% |
| Other values (13) | 121453 |
| Value | Count | Frequency (%) |
| 1900 | 622 | 0.1% |
| 1961 | 779 | 0.1% |
| 1990 | 3885 | |
| 1994 | 1552 | 0.2% |
| 1995 | 1404 | 0.2% |
| Value | Count | Frequency (%) |
| 2015 | 91118 | |
| 2014 | 151774 | |
| 2013 | 170465 | |
| 2012 | 61716 | 7.3% |
| 2011 | 41363 | 4.9% |
promo2
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 844338 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 423292 | |
| 1 | 421046 |
| Value | Count | Frequency (%) |
| 0 | 423292 | |
| 1 | 421046 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 423292 | |
| 1 | 421046 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 844338 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 423292 | |
| 1 | 421046 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 844338 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 423292 | |
| 1 | 421046 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 844338 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 423292 | |
| 1 | 421046 |
promo2_since_week
Real number (ℝ≥0)
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.62908338 |
| Minimum | 1 |
|---|---|
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 12 |
| median | 22 |
| Q3 | 37 |
| 95-th percentile | 47 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 14.28831488 |
|---|---|
| Coefficient of variation (CV) | 0.6046918813 |
| Kurtosis | -1.194814545 |
| Mean | 23.62908338 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.1703986475 |
| Sum | 19950933 |
| Variance | 204.1559421 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 14 | 69320 | 8.2% |
| 40 | 56919 | 6.7% |
| 31 | 42369 | 5.0% |
| 10 | 42002 | 5.0% |
| 5 | 39506 | 4.7% |
| 1 | 34479 | 4.1% |
| 13 | 33878 | 4.0% |
| 37 | 33528 | 4.0% |
| 22 | 32208 | 3.8% |
| 18 | 30709 | 3.6% |
| Other values (42) | 429420 |
| Value | Count | Frequency (%) |
| 1 | 34479 | |
| 2 | 9644 | 1.1% |
| 3 | 9784 | 1.2% |
| 4 | 9778 | 1.2% |
| 5 | 39506 |
| Value | Count | Frequency (%) |
| 52 | 4342 | 0.5% |
| 51 | 6424 | |
| 50 | 7188 | |
| 49 | 7030 | |
| 48 | 13442 |
promo2_since_year
Real number (ℝ≥0)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2012.797915 |
| Minimum | 2009 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | 2009 |
|---|---|
| 5-th percentile | 2009 |
| Q1 | 2012 |
| median | 2013 |
| Q3 | 2014 |
| 95-th percentile | 2015 |
| Maximum | 2015 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.660124714 |
|---|---|
| Coefficient of variation (CV) | 0.0008247845956 |
| Kurtosis | -0.1979112532 |
| Mean | 2012.797915 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.788295691 |
| Sum | 1699481766 |
| Variance | 2.756014067 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2013 | 257197 | |
| 2014 | 227630 | |
| 2015 | 103528 | |
| 2011 | 95035 | 11.3% |
| 2012 | 60712 | 7.2% |
| 2009 | 53824 | 6.4% |
| 2010 | 46412 | 5.5% |
| Value | Count | Frequency (%) |
| 2009 | 53824 | 6.4% |
| 2010 | 46412 | 5.5% |
| 2011 | 95035 | 11.3% |
| 2012 | 60712 | 7.2% |
| 2013 | 257197 |
| Value | Count | Frequency (%) |
| 2015 | 103528 | |
| 2014 | 227630 | |
| 2013 | 257197 | |
| 2012 | 60712 | 7.2% |
| 2011 | 95035 | 11.3% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.52034967 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.723712365 |
|---|---|
| Coefficient of variation (CV) | 0.4896423725 |
| Kurtosis | -1.259347431 |
| Mean | 3.52034967 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.0193099865 |
| Sum | 2972365 |
| Variance | 2.971184316 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 144052 | |
| 2 | 143955 | |
| 3 | 141922 | |
| 5 | 138633 | |
| 1 | 137557 | |
| 4 | 134626 | |
| 7 | 3593 | 0.4% |
| Value | Count | Frequency (%) |
| 1 | 137557 | |
| 2 | 143955 | |
| 3 | 141922 | |
| 4 | 134626 | |
| 5 | 138633 |
| Value | Count | Frequency (%) |
| 7 | 3593 | 0.4% |
| 6 | 144052 | |
| 5 | 138633 | |
| 4 | 134626 | |
| 3 | 141922 |
date
Date
| Distinct | 942 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| Minimum | 2013-01-01 00:00:00 |
|---|---|
| Maximum | 2015-07-31 00:00:00 |
sales
Real number (ℝ≥0)
| Distinct | 21733 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6955.959134 |
| Minimum | 46 |
|---|---|
| Maximum | 41551 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | 46 |
|---|---|
| 5-th percentile | 3174 |
| Q1 | 4859 |
| median | 6369 |
| Q3 | 8360 |
| 95-th percentile | 12668 |
| Maximum | 41551 |
| Range | 41505 |
| Interquartile range (IQR) | 3501 |
Descriptive statistics
| Standard deviation | 3103.815515 |
|---|---|
| Coefficient of variation (CV) | 0.4462095673 |
| Kurtosis | 4.854026586 |
| Mean | 6955.959134 |
| Median Absolute Deviation (MAD) | 1694 |
| Skewness | 1.594928836 |
| Sum | 5873180623 |
| Variance | 9633670.754 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 5674 | 215 | < 0.1% |
| 5558 | 197 | < 0.1% |
| 5483 | 196 | < 0.1% |
| 6214 | 195 | < 0.1% |
| 6049 | 195 | < 0.1% |
| 5723 | 194 | < 0.1% |
| 5449 | 192 | < 0.1% |
| 5489 | 191 | < 0.1% |
| 5140 | 191 | < 0.1% |
| 5041 | 190 | < 0.1% |
| Other values (21723) | 842382 |
| Value | Count | Frequency (%) |
| 46 | 1 | |
| 124 | 1 | |
| 133 | 1 | |
| 286 | 1 | |
| 297 | 1 |
| Value | Count | Frequency (%) |
| 41551 | 1 | |
| 38722 | 1 | |
| 38484 | 1 | |
| 38367 | 1 | |
| 38037 | 1 |
promo
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 844338 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 0 | 467463 | |
| 1 | 376875 |
| Value | Count | Frequency (%) |
| 0 | 467463 | |
| 1 | 376875 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 467463 | |
| 1 | 376875 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 844338 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 467463 | |
| 1 | 376875 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 844338 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 467463 | |
| 1 | 376875 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 844338 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 467463 | |
| 1 | 376875 |
state_holiday
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| regular_day | |
|---|---|
| public_holiday | 694 |
| easter_holiday | 145 |
| christmas | 71 |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 11.00281285 |
| Min length | 9 |
Characters and Unicode
| Total characters | 9290093 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | regular_day |
|---|---|
| 2nd row | regular_day |
| 3rd row | regular_day |
| 4th row | regular_day |
| 5th row | regular_day |
| Value | Count | Frequency (%) |
| regular_day | 843428 | |
| public_holiday | 694 | 0.1% |
| easter_holiday | 145 | < 0.1% |
| christmas | 71 | < 0.1% |
| Value | Count | Frequency (%) |
| regular_day | 843428 | |
| public_holiday | 694 | 0.1% |
| easter_holiday | 145 | < 0.1% |
| christmas | 71 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1687911 | |
| r | 1687072 | |
| l | 844961 | |
| _ | 844267 | |
| d | 844267 | |
| y | 844267 | |
| u | 844122 | |
| e | 843718 | |
| g | 843428 | |
| i | 1604 | < 0.1% |
| Other values (8) | 4476 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8445826 | |
| Connector Punctuation | 844267 | 9.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 1687911 | |
| r | 1687072 | |
| l | 844961 | |
| d | 844267 | |
| y | 844267 | |
| u | 844122 | |
| e | 843718 | |
| g | 843428 | |
| i | 1604 | < 0.1% |
| h | 910 | < 0.1% |
| Other values (7) | 3566 | < 0.1% |
| Value | Count | Frequency (%) |
| _ | 844267 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8445826 | |
| Common | 844267 | 9.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 1687911 | |
| r | 1687072 | |
| l | 844961 | |
| d | 844267 | |
| y | 844267 | |
| u | 844122 | |
| e | 843718 | |
| g | 843428 | |
| i | 1604 | < 0.1% |
| h | 910 | < 0.1% |
| Other values (7) | 3566 | < 0.1% |
| Value | Count | Frequency (%) |
| _ | 844267 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9290093 |
Most frequent character per block
| Value | Count | Frequency (%) |
| a | 1687911 | |
| r | 1687072 | |
| l | 844961 | |
| _ | 844267 | |
| d | 844267 | |
| y | 844267 | |
| u | 844122 | |
| e | 843718 | |
| g | 843428 | |
| i | 1604 | < 0.1% |
| Other values (8) | 4476 | < 0.1% |
school_holiday
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 844338 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 0 | 680893 | |
| 1 | 163445 | 19.4% |
| Value | Count | Frequency (%) |
| 0 | 680893 | |
| 1 | 163445 | 19.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 680893 | |
| 1 | 163445 | 19.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 844338 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 680893 | |
| 1 | 163445 | 19.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 844338 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 680893 | |
| 1 | 163445 | 19.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 844338 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 680893 | |
| 1 | 163445 | 19.4% |
is_promo
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 844338 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 713533 | |
| 1 | 130805 | 15.5% |
| Value | Count | Frequency (%) |
| 0 | 713533 | |
| 1 | 130805 | 15.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 713533 | |
| 1 | 130805 | 15.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 844338 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 713533 | |
| 1 | 130805 | 15.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 844338 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 713533 | |
| 1 | 130805 | 15.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 844338 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 713533 | |
| 1 | 130805 | 15.5% |
year
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| 2013 | |
|---|---|
| 2014 | |
| 2015 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3377352 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015 |
|---|---|
| 2nd row | 2015 |
| 3rd row | 2015 |
| 4th row | 2015 |
| 5th row | 2015 |
| Value | Count | Frequency (%) |
| 2013 | 337924 | |
| 2014 | 310385 | |
| 2015 | 196029 |
| Value | Count | Frequency (%) |
| 2013 | 337924 | |
| 2014 | 310385 | |
| 2015 | 196029 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 844338 | |
| 0 | 844338 | |
| 1 | 844338 | |
| 3 | 337924 | |
| 4 | 310385 | 9.2% |
| 5 | 196029 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3377352 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 844338 | |
| 0 | 844338 | |
| 1 | 844338 | |
| 3 | 337924 | |
| 4 | 310385 | 9.2% |
| 5 | 196029 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3377352 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 844338 | |
| 0 | 844338 | |
| 1 | 844338 | |
| 3 | 337924 | |
| 4 | 310385 | 9.2% |
| 5 | 196029 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3377352 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 844338 | |
| 0 | 844338 | |
| 1 | 844338 | |
| 3 | 337924 | |
| 4 | 310385 | 9.2% |
| 5 | 196029 | 5.8% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.845773849 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 8 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.323959483 |
|---|---|
| Coefficient of variation (CV) | 0.5686089762 |
| Kurtosis | -1.033189967 |
| Mean | 5.845773849 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.2577064283 |
| Sum | 4935809 |
| Variance | 11.04870665 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 86335 | |
| 3 | 85975 | |
| 7 | 85576 | |
| 6 | 82571 | |
| 4 | 81726 | |
| 2 | 80239 | |
| 5 | 80099 | |
| 8 | 54411 | |
| 10 | 53291 | |
| 9 | 52321 | |
| Other values (2) | 101794 |
| Value | Count | Frequency (%) |
| 1 | 86335 | |
| 2 | 80239 | |
| 3 | 85975 | |
| 4 | 81726 | |
| 5 | 80099 |
| Value | Count | Frequency (%) |
| 12 | 50393 | |
| 11 | 51401 | |
| 10 | 53291 | |
| 9 | 52321 | |
| 8 | 54411 |
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.64694589 |
| Minimum | 1 |
|---|---|
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 11 |
| median | 23 |
| Q3 | 35 |
| 95-th percentile | 49 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 14.3899309 |
|---|---|
| Coefficient of variation (CV) | 0.6085323225 |
| Kurtosis | -1.02576046 |
| Mean | 23.64694589 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.2622868966 |
| Sum | 19966015 |
| Variance | 207.0701114 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 26 | 20119 | 2.4% |
| 12 | 20098 | 2.4% |
| 9 | 20093 | 2.4% |
| 11 | 20079 | 2.4% |
| 6 | 20066 | 2.4% |
| 5 | 20063 | 2.4% |
| 8 | 20053 | 2.4% |
| 10 | 20051 | 2.4% |
| 4 | 20044 | 2.4% |
| 3 | 20040 | 2.4% |
| Other values (42) | 643632 |
| Value | Count | Frequency (%) |
| 1 | 15161 | |
| 2 | 19448 | |
| 3 | 20040 | |
| 4 | 20044 | |
| 5 | 20063 |
| Value | Count | Frequency (%) |
| 52 | 8319 | |
| 51 | 12355 | |
| 50 | 12333 | |
| 49 | 12334 | |
| 48 | 12334 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.52034967 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 137557 |
| Zeros (%) | 16.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.723712365 |
|---|---|
| Coefficient of variation (CV) | 0.683917944 |
| Kurtosis | -1.259347431 |
| Mean | 2.52034967 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.0193099865 |
| Sum | 2128027 |
| Variance | 2.971184316 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 144052 | |
| 1 | 143955 | |
| 2 | 141922 | |
| 4 | 138633 | |
| 0 | 137557 | |
| 3 | 134626 | |
| 6 | 3593 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 137557 | |
| 1 | 143955 | |
| 2 | 141922 | |
| 3 | 134626 | |
| 4 | 138633 |
| Value | Count | Frequency (%) |
| 6 | 3593 | 0.4% |
| 5 | 144052 | |
| 4 | 138633 | |
| 3 | 134626 | |
| 2 | 141922 |
seasons
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| spring | |
|---|---|
| winter | |
| summer | |
| fall |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.627933363 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4751878 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | summer |
|---|---|
| 2nd row | summer |
| 3rd row | summer |
| 4th row | summer |
| 5th row | summer |
| Value | Count | Frequency (%) |
| spring | 246607 | |
| winter | 237969 | |
| summer | 202687 | |
| fall | 157075 |
| Value | Count | Frequency (%) |
| spring | 246607 | |
| winter | 237969 | |
| summer | 202687 | |
| fall | 157075 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 687263 | |
| i | 484576 | |
| n | 484576 | |
| s | 449294 | |
| e | 440656 | |
| m | 405374 | |
| l | 314150 | |
| p | 246607 | 5.2% |
| g | 246607 | 5.2% |
| w | 237969 | 5.0% |
| Other values (4) | 754806 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4751878 |
Most frequent character per category
| Value | Count | Frequency (%) |
| r | 687263 | |
| i | 484576 | |
| n | 484576 | |
| s | 449294 | |
| e | 440656 | |
| m | 405374 | |
| l | 314150 | |
| p | 246607 | 5.2% |
| g | 246607 | 5.2% |
| w | 237969 | 5.0% |
| Other values (4) | 754806 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4751878 |
Most frequent character per script
| Value | Count | Frequency (%) |
| r | 687263 | |
| i | 484576 | |
| n | 484576 | |
| s | 449294 | |
| e | 440656 | |
| m | 405374 | |
| l | 314150 | |
| p | 246607 | 5.2% |
| g | 246607 | 5.2% |
| w | 237969 | 5.0% |
| Other values (4) | 754806 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4751878 |
Most frequent character per block
| Value | Count | Frequency (%) |
| r | 687263 | |
| i | 484576 | |
| n | 484576 | |
| s | 449294 | |
| e | 440656 | |
| m | 405374 | |
| l | 314150 | |
| p | 246607 | 5.2% |
| g | 246607 | 5.2% |
| w | 237969 | 5.0% |
| Other values (4) | 754806 |
| Distinct | 376 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.67967212 |
| Minimum | -32 |
|---|---|
| Maximum | 1407 |
| Zeros | 268025 |
| Zeros (%) | 31.7% |
| Negative | 70101 |
| Negative (%) | 8.3% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | -32 |
|---|---|
| 5-th percentile | -7 |
| Q1 | 0 |
| median | 16 |
| Q3 | 74 |
| 95-th percentile | 145 |
| Maximum | 1407 |
| Range | 1439 |
| Interquartile range (IQR) | 74 |
Descriptive statistics
| Standard deviation | 66.8144125 |
|---|---|
| Coefficient of variation (CV) | 1.60304554 |
| Kurtosis | 126.8558883 |
| Mean | 41.67967212 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 7.338855632 |
| Sum | 35191731 |
| Variance | 4464.165718 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 268025 | |
| 1 | 9476 | 1.1% |
| 7 | 5316 | 0.6% |
| 5 | 5234 | 0.6% |
| 4 | 5232 | 0.6% |
| 6 | 5214 | 0.6% |
| 9 | 5163 | 0.6% |
| 8 | 5147 | 0.6% |
| 10 | 5140 | 0.6% |
| 11 | 5038 | 0.6% |
| Other values (366) | 525353 |
| Value | Count | Frequency (%) |
| -32 | 30 | < 0.1% |
| -31 | 147 | < 0.1% |
| -30 | 323 | |
| -29 | 445 | |
| -28 | 593 |
| Value | Count | Frequency (%) |
| 1407 | 5 | < 0.1% |
| 1406 | 25 | |
| 1405 | 25 | |
| 1404 | 23 | |
| 1403 | 23 |
promo_since
Date
| Distinct | 167 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.4 MiB |
| Minimum | 2009-07-27 00:00:00 |
|---|---|
| Maximum | 2015-07-27 00:00:00 |
| Distinct | 440 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.40069854 |
| Minimum | -126 |
|---|---|
| Maximum | 313 |
| Zeros | 421646 |
| Zeros (%) | 49.9% |
| Negative | 57241 |
| Negative (%) | 6.8% |
| Memory size | 6.4 MiB |
Quantile statistics
| Minimum | -126 |
|---|---|
| 5-th percentile | -19 |
| Q1 | 0 |
| median | 0 |
| Q3 | 109 |
| 95-th percentile | 230 |
| Maximum | 313 |
| Range | 439 |
| Interquartile range (IQR) | 109 |
Descriptive statistics
| Standard deviation | 85.45755889 |
|---|---|
| Coefficient of variation (CV) | 1.570890838 |
| Kurtosis | 0.1129960976 |
| Mean | 54.40069854 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.103383487 |
| Sum | 45932577 |
| Variance | 7302.994372 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 421646 | |
| 52 | 3910 | 0.5% |
| 98 | 1872 | 0.2% |
| 102 | 1847 | 0.2% |
| 97 | 1830 | 0.2% |
| 103 | 1828 | 0.2% |
| 101 | 1778 | 0.2% |
| 99 | 1777 | 0.2% |
| 94 | 1770 | 0.2% |
| 93 | 1764 | 0.2% |
| Other values (430) | 404316 |
| Value | Count | Frequency (%) |
| -126 | 12 | |
| -125 | 18 | |
| -124 | 18 | |
| -123 | 18 | |
| -122 | 18 |
| Value | Count | Frequency (%) |
| 313 | 35 | |
| 312 | 42 | |
| 311 | 42 | |
| 310 | 42 | |
| 309 | 42 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | store | store_type | assortment | competition_distance | competition_open_since_month | competition_open_since_year | promo2 | promo2_since_week | promo2_since_year | day_of_week | date | sales | promo | state_holiday | school_holiday | is_promo | year | month | weekofyear | dayofweek | seasons | competition_time_month | promo_since | promo_time_week | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1 | c | basic | 1270.00 | 9 | 2008 | 0 | 31 | 2015 | 5 | 2015-07-31 | 5263 | 1 | regular_day | 1 | 0 | 2015 | 7 | 31 | 4 | summer | 84 | 2015-07-27 | 0 |
| 1 | 1 | 1 | c | basic | 1270.00 | 9 | 2008 | 0 | 31 | 2015 | 4 | 2015-07-30 | 5020 | 1 | regular_day | 1 | 0 | 2015 | 7 | 31 | 3 | summer | 84 | 2015-07-27 | 0 |
| 2 | 2 | 1 | c | basic | 1270.00 | 9 | 2008 | 0 | 31 | 2015 | 3 | 2015-07-29 | 4782 | 1 | regular_day | 1 | 0 | 2015 | 7 | 31 | 2 | summer | 84 | 2015-07-27 | 0 |
| 3 | 3 | 1 | c | basic | 1270.00 | 9 | 2008 | 0 | 31 | 2015 | 2 | 2015-07-28 | 5011 | 1 | regular_day | 1 | 0 | 2015 | 7 | 31 | 1 | summer | 84 | 2015-07-27 | 0 |
| 4 | 4 | 1 | c | basic | 1270.00 | 9 | 2008 | 0 | 31 | 2015 | 1 | 2015-07-27 | 6102 | 1 | regular_day | 1 | 0 | 2015 | 7 | 31 | 0 | summer | 84 | 2015-07-27 | 0 |
| 5 | 6 | 1 | c | basic | 1270.00 | 9 | 2008 | 0 | 30 | 2015 | 6 | 2015-07-25 | 4364 | 0 | regular_day | 0 | 0 | 2015 | 7 | 30 | 5 | summer | 83 | 2015-07-20 | 0 |
| 6 | 7 | 1 | c | basic | 1270.00 | 9 | 2008 | 0 | 30 | 2015 | 5 | 2015-07-24 | 3706 | 0 | regular_day | 0 | 0 | 2015 | 7 | 30 | 4 | summer | 83 | 2015-07-20 | 0 |
| 7 | 8 | 1 | c | basic | 1270.00 | 9 | 2008 | 0 | 30 | 2015 | 4 | 2015-07-23 | 3769 | 0 | regular_day | 0 | 0 | 2015 | 7 | 30 | 3 | summer | 83 | 2015-07-20 | 0 |
| 8 | 9 | 1 | c | basic | 1270.00 | 9 | 2008 | 0 | 30 | 2015 | 3 | 2015-07-22 | 3464 | 0 | regular_day | 0 | 0 | 2015 | 7 | 30 | 2 | summer | 83 | 2015-07-20 | 0 |
| 9 | 10 | 1 | c | basic | 1270.00 | 9 | 2008 | 0 | 30 | 2015 | 2 | 2015-07-21 | 3558 | 0 | regular_day | 0 | 0 | 2015 | 7 | 30 | 1 | summer | 83 | 2015-07-20 | 0 |
Last rows
| df_index | store | store_type | assortment | competition_distance | competition_open_since_month | competition_open_since_year | promo2 | promo2_since_week | promo2_since_year | day_of_week | date | sales | promo | state_holiday | school_holiday | is_promo | year | month | weekofyear | dayofweek | seasons | competition_time_month | promo_since | promo_time_week | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 844328 | 1017197 | 1115 | d | extended | 5350.00 | 1 | 2013 | 1 | 22 | 2012 | 6 | 2013-01-12 | 4497 | 0 | regular_day | 0 | 0 | 2013 | 1 | 2 | 5 | winter | 0 | 2012-05-21 | 33 |
| 844329 | 1017198 | 1115 | d | extended | 5350.00 | 1 | 2013 | 1 | 22 | 2012 | 5 | 2013-01-11 | 5142 | 1 | regular_day | 1 | 0 | 2013 | 1 | 2 | 4 | winter | 0 | 2012-05-21 | 33 |
| 844330 | 1017199 | 1115 | d | extended | 5350.00 | 1 | 2013 | 1 | 22 | 2012 | 4 | 2013-01-10 | 5007 | 1 | regular_day | 1 | 0 | 2013 | 1 | 2 | 3 | winter | 0 | 2012-05-21 | 33 |
| 844331 | 1017200 | 1115 | d | extended | 5350.00 | 1 | 2013 | 1 | 22 | 2012 | 3 | 2013-01-09 | 4649 | 1 | regular_day | 1 | 0 | 2013 | 1 | 2 | 2 | winter | 0 | 2012-05-21 | 33 |
| 844332 | 1017201 | 1115 | d | extended | 5350.00 | 1 | 2013 | 1 | 22 | 2012 | 2 | 2013-01-08 | 5243 | 1 | regular_day | 1 | 0 | 2013 | 1 | 2 | 1 | winter | 0 | 2012-05-21 | 33 |
| 844333 | 1017202 | 1115 | d | extended | 5350.00 | 1 | 2013 | 1 | 22 | 2012 | 1 | 2013-01-07 | 6905 | 1 | regular_day | 1 | 0 | 2013 | 1 | 2 | 0 | winter | 0 | 2012-05-21 | 33 |
| 844334 | 1017204 | 1115 | d | extended | 5350.00 | 1 | 2013 | 1 | 22 | 2012 | 6 | 2013-01-05 | 4771 | 0 | regular_day | 1 | 0 | 2013 | 1 | 1 | 5 | winter | 0 | 2012-05-21 | 32 |
| 844335 | 1017205 | 1115 | d | extended | 5350.00 | 1 | 2013 | 1 | 22 | 2012 | 5 | 2013-01-04 | 4540 | 0 | regular_day | 1 | 0 | 2013 | 1 | 1 | 4 | winter | 0 | 2012-05-21 | 32 |
| 844336 | 1017206 | 1115 | d | extended | 5350.00 | 1 | 2013 | 1 | 22 | 2012 | 4 | 2013-01-03 | 4297 | 0 | regular_day | 1 | 0 | 2013 | 1 | 1 | 3 | winter | 0 | 2012-05-21 | 32 |
| 844337 | 1017207 | 1115 | d | extended | 5350.00 | 1 | 2013 | 1 | 22 | 2012 | 3 | 2013-01-02 | 3697 | 0 | regular_day | 1 | 0 | 2013 | 1 | 1 | 2 | winter | 0 | 2012-05-21 | 32 |